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Abstract 


SDD-1,  A  System  for  Distributed  Databases,  is  currently 
being  designed  and  implemented  by  Computer  Corporation  of 
America.  SDD-1  is  composed  of  a  collection  of  datamodules 
which  may  be  dispersed  geographically  and  which  are 
assumed  to  communicate  over  channels  which  may  vary  in 
bandwidth  and  delay.  Also,  the  system  supports  redundant 
databases,  meaning  that  portions  of  databases may  be\ 
stored  at  two  or  more  datamodules  in  order  to  improve  the 
reliability  and  efficiency  of  database  operations. 

This  paper  presents  a  brief  overview  of  several  key  facets 
of  SDD-1.  These  include: 

-  the  system  architecture, 

-  the  data  distribution  concepts  used  by  the  system, 

-  SDD-1 ' s  approach  to  directory  management, 

-  the  method  employed  for  efficiently  handling 
updates  to  redundantly  stored  data,  and 

-  SDD-1 '  s  method  for  efficiently  retrieving  data 
which  is  dispersed  at  distant  datamodules. 
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1 .  Introduction 

SDD-1  is  a  distributed  database  system  being  designed  and 
implemented  by  Computer  Corporation  of  America  (CCA)  in  a 
project  sponsored  by  the  Advanced  Research  Projects  Agency 
(ARPA)  of  the  Department  of  Defense.  Work  on  the  system 
is  currently  in  progress.  This  paper  describes  the 
system’s  preliminary  design. 

SDD-1  is  designed  to  support  databases  distributed  world¬ 
wide  over  hundreds  of  database  sites.  The  database  sites 
are  assumed  to  communicate  over  heterogeneous  communica¬ 
tion  channels  which  may  vary  in  bandwidth  and  delay,  and 
it  is  not  assumed  that  all  sites  are  able  to  maintain  con¬ 
tinuous  communication  with  each  other.  Also,  SDD-1  is 
designed  to  support  databases  which  are  stored  redundant¬ 
ly.  meaning  that  some  or  all  logical  data  items  can  be 
stored  at  multiple  database  sites  in  order  to  enhance  the 
reliability  and  responsiveness  of  the  system. 

The  objectives  which  SDD-1  is  intended  to  achieve  are  the 
following: 
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1.  reliability/survivability  -  the  system  must  con¬ 
tinue  operation  despite  the  failure  or  inaccessi¬ 
bility  of  one  or  more  of  its  database  sites. 

2.  "tunable”  efficiency  -  it  must  be  possible  to  dis¬ 
tribute  data  in  such  a  manner  that  Dortions  which 
are  heavily  used  in  a  given  geographical  region  can 
be  stored  near  that  region. 

3.  modular  upward  scaling  -  as  databases  increase  in 
size  and  usage,  it  must  be  possible  to  augment 
system  capacity  to  accommodate  these  increases  by 
the  incremental  addition  of  new  database  sites. 

It  is  important  to  note  the  key  role  which  data  redundancy 
plays  in  achieving  these  objectives.  Reliability  and  sur¬ 
vivability  are  enhanced  since  SDD-1  can  continue  to  access 
critical  data  items  even  if  some  of  the  database  sites  at 
which  the  data  are  stored  become  inaccessible.  Efficiency 
is  enhanced  since  data  items  which  are  accessed  by  widely 
separated  user  communities  can  be  stored  nearby  each  of 
the  communities.  Redundancy  also  enhances  scalability 
since  growth  in  database  usage  can  be  accommodated  by  in¬ 
creasing  the  number  of  data  copies  rather  than  by  increas¬ 
ing  the  speed  of  memory  units. 
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On  the  other  hand,  data  redundancy  introduces  severe 
problems  in  performing  updates  in  a  consistent  manner. 
The  approach  taken  by  SDD-1  in  handling  these  redundant 
update  problems  is  addressed  in  Section  4  of  this  paper 
and  in  [ROTHNIE  and  GOODMAN],  [ROTHNIE  et  al],  and 
[BERNSTEIN]. 

Other  distributed  database  system  designs  (e.g.,  [ALSBERG 
and  DAY],  [ELLIS],  and  [THOMAS-b])  have  permitted  redun¬ 
dantly  stored  data,  but  have  required  that  the  entire  da¬ 
tabase  be  stored  at  every  database  site.  This  restriction 
is  unrealistic  for  large  databases  and  effectively  pre¬ 
cludes  "tunable"  efficiency  and  upward  scalability. 

This  paper  summarizes  certain  key  characteristics  of  the 
SDD-1  design.  Section  2  describes  the  architecture  of  the 
system  and  presents  the  framework  used  by  SDD-1  for 
defining  the  distribution  and  redundancy  of  logical  data¬ 
bases.  Sections  3-5  then  describe  the  manner  in  which 
SDD-1  deals  with  key  technical  difficulties  in  distributed 
database  management:  Section  3  discusses  the  management 
of  directory  information;  Section  4  explains  the  approach 
taken  in  SDD-1  to  the  problem  of  updating  redundantly 
stored  data;  and  Section  5  presents  SDD-1's  approach  to 
efficiently  processing  retrievals  involving  dispersed 


data. 
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2.  Architecture  and  Distribution  Concepts 


2.1  Global  System  Architecture 

Figure  2.1  illustrates  the  general  architecture  of  SDD-1. 
The  system  is  composed  of  a  set  of  components  called  data- 
modules  which  communicate  over  assorted  communication 
channels.  The  datamodules  are  all  functionally  identical; 
that  is,  they  respond  identically  to  external  stimuli. 
However,  there  may  be  a  great  variety  of  datamodule  imple¬ 
mentations.  For  example,  some  datamodules  may  support 
mass  memories  and  operate  as  major  data  repositories  in 
the  net  while  others  may  be  small  systems  with  little 
storage,  operating  as  data  caches  for  quick  response  to 
nearby  users. 

The  distribution  and  redundancy  of  data  in  SDD-1  is  invis¬ 
ible  to  users.  A  user  connecting  to  anv  datamodule  is 
presented  with  the  illusion  that  a  complete  and  non- 
redundant  database  is  resident  at  that  datamodule.  In  re¬ 
sponse  to  users'  queries  SDD-1  takes  responsibility  for 
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General  System  Architecture  of  SDD-1  Figure  2.1 


SDD-1  GENITAL  SYSTEM  DESCRIPTION 


•  SDD-1  COMPRISED  OF  DISTRIBUTED  DATAMODULES. 

•  Distribution  is  invisible  to  users. 


•  Data  is  stored  redundantly. 
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locating  the  desired  data  among  the  distributed  network  of 
datamodules  and  for  updating  all  copies  of  changed  data 
items;  ideally  users  are  able  to  interact  with  the  dis¬ 
tributed  system  as  easily  as  with  a  conventional,  central¬ 
ized  one. 

The  objective  of  relieving  users  of  the  need  to  be  aware 

of  distribution  issues  is  similar  to  goals  pursued  in  the 
distributed  operating  systems  being  developed  today,  e.g. 
by  the  National  Software  Works  (cf  [SCHANTZ  and 
MILLSTEIN]),  the  National  Bureau  of  Standards  NAM  project 
(cf  [ROSENTHAL]),  and  the  Arpanet  RSEXEC  project  (cf 
[THOMAS-a]).  These  projects  take  a  collective  computing 
resource  distributed  geographically  on  a  computer  network 
and  make  it  available  to  users  in  an  integrated,  easy  to 
use  manner.  This  approach  in  the  distributed  database 
area  has  also  been  discussed  by  [ALSBERG  et  al],  [ALSBERG 
and  DAY],  [ELLIS],  [STONEBRAKER  and  NEUHOLD] ,  and 
[THOMAS-b]. 
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2.2  Datamodule  Architecture 


This  section  examines  the  internal  architecture  of  the 
SDD-1  datamodules.  The  datamodule  architecture  is  illus¬ 
trated  in  Figure  2.2.  As  the  figure  indicates,  each  data¬ 
module  consists  of  two  internal  modules  called  the  global 
data  manager  (GDM)  and  the  local  data  manager  (LDM)  re¬ 
spectively. 

The  LDM  manages  data  located  at  the  datamodule  of  which  it 
is  a  part.  It  has  no  awareness  of  data  distribution 
issues  whatsoever.  The  LDM  can  applv  data  management  op¬ 
erators  to  its  locally  stored  data,  and  it  can  retrieve 
and  modify  locally  stored  data.  The  LDM  performs  these 
functions  in  response  to  requests  from  GDM's. 

GDM's,  on  the  other  hand,  have  no  local  storage  at  all  and 
depend  entirely  on  the  LDM's  for  storage  resources.  Their 
role  is  to  handle  all  the  data  distribution  issues  in  SDD- 
1.  Specifically,  the  functions  of  a  GDM  are  the  follow¬ 


ing 
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Datamodule  Architecture  Ficure  2.2 


DM. 


DM  2 


DM3 


The  GDM  parses  a  user  request  into  an  internal  form 
which  exposes  what  operations  are  to  be  performed 
and  what  data  is  to  be  operated  upon. 

The  GDM  determines  which  datamodules  house  data  in¬ 
volved  in  the  requests.  This  determination 
involves  access  to  directory  information  which  is 
managed  as  we  describe  in  Section  3. 

The  GDM  decides  what  operations  should  be  performed 
by  LDM's  in  the  network.  The  decision  process 


takes  two  factors  into  account:  one  is  access  ef- 
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ficiency  (see  Section  5  and  [WONG])  and  the  other 
is  database  consistency  (see  Section  4,  [ROTHNiE  et 
al]  and  [BERNSTEIN]). 

A  key  motivation  for  selecting  this  GDM/LDM  structure  for 
the  datamodule  architecture  is  the  long  run  goal  of  em¬ 
ploying  a  variety  of  existing  database  management  systems 
in  the  role  of  LDM.  The  LDM  operations  have  been  consci¬ 
ously  confined  to  the  level  of  function  currently  provided 
by  typical  database  management  systems.  Py  equipDing 
GDM's  with  the  rules  for  constructing  their  Drimitive  LDM 
calls  in  the  language  of  the  target  LDM,  a  heterogeneous 
distributed  database  system  can  be  achieved. 

In  the  initial  SDD-1  implementation  all  LDM's  will  be 
Datacomputers  (cf  [MARILL  and  STERN]).  Since  the  Dataoom- 
Duter  was  designed  and  implemented  as  a  database  manage¬ 
ment  system  without  any  regard  for  its  future  role  as  an 
LDM,  the  feasibility  of  using  the  Datacomputer  in  this  way 
suggests  the  plausibility  of  using  other  DBMS's  in  this 
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2.3  Data  Distribution  Concents 


The  logical  data  model  supported  by  SDD-1  is  relational. 
In  this  section  we  consider  the  stored  representation  of  a 
set  of  relations  in  SDD-1 . 

The  assignment  of  logical  data  items  to  the  physical 
storage  resources  of  the  datamodules  begins  with  the  par¬ 
titioning  of  each  relation  into  sub-sets  called  fragments. 
Each  fragment  is  defined  to  be  a  rectangular  sub-set  of  a 
relation;  i.e.  fragments  are  defined  as  restrictions  and 
projections  of  database  relations.  Furthermore,  restric¬ 
tions  are  constrained  to  involve  simple  predicates  as  de¬ 
fined  by  [ESWARAN  et  al],  i.e.,  boolean  conditions  of  the 
form: 


Attribute  Relational  operator>  constant 

where  Relational  operator>:=  "r"  ,"<"  ,">"  ,  or 


Also,  to  avoid  updating  anomalies  similar  to  those  de¬ 
scribed  by  [CHAMBERLIN  et  al]  with  resoect  to  views,  each 
projection  is  required  to  include  the  primary  key. 
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Figure  2.3  illustrates  the  partitioning  of  a  PERSONNEL 
logical  relation  into  fragments . • 

Fragments  then,  are  the  units  of  assignment  of  logical 
data  items  to  datamodules .  A  given  fragment  is,  by  defi¬ 
nition,  either  entirely  Dresent  or  entirely  absent  at  each 
datamodule,  However,  each  fragment  may  be  stored  re¬ 
dundantly  at  more  than  one  module.  The  stored  reoresenta- 
tion  of  a  fragment  in  a  given  module  is  termed  a  stored 
fragment  and  is  designated  Stored-F^  m  meaning  the  reore- 
sentation  of  fragment  in  datamodule  m.  Figure  2.U  il¬ 
lustrates  the  partition  of  a  'logical  database  into  frag¬ 
ments  and  the  assignment  of  fragments  to  modules.  Each 
arc  from  the  rectangles  (representing  fragments)  to  the 
circles  ( representing  datamodules)  corresponds  to  a  stored 
fragment . 

Since  fragments  may  be  stored  redundantlv,  in  general 
there  will  be  more  than  one  way  of  reconstructing  a  com¬ 
plete  and  non-redundant  copy  of  the  logical  database  from 


*  A  wav  of  achieving  this  reauirement  without  impacting 
users  is  to  include  in  every  relation  an  attribute  called 
TID  (for  tuple  ID)  which  can  be  used  by  the  system  inter¬ 
nally  but  is  not  visible  to  users.  INGRES  [STONEBRAKER  et 
al]  and  System  R  [ASTRAHAN  et  al]  use  the  TID  concept  for 
other  purposes.  Like  those  uses,  though,  we  assume  that 
TID  is  guaranteed  to  be  unioue  for  each  tuple.  In  this 
way,  the  primary  key  reauired  in  each  fragment-defining 
projection  can  simply  be  TID. 
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Partition  of  a  Relation  into  Fragments  Figure  2.3 


Each  fragment  is  defined  as  follows: 


-  PERSONNEL.  := 

PERSONNEL  where  Salary  >  $30,000, 
projected  on  Name,  Age,  Position,  TID 

-  PERSONNELp  := 

PERSONNEL  where  Salary  >  $30,000, 
projected  on  Supervisor,  Department,  TID 

-  PERSONNEL,  := 

3  PERSONNEL  where  Salary  <=  $30,000, 
projected  on  Name,  Age,  Position, 
Supervisor,  Department,  TID 

-  personnel,.  := 

PERSONNEL  projected  on  Salary, 
Years-of-service,  TID 
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Assignment  of  Fragments  to  Datamodules  Figure  2.U 


the  collection  of  stored  fragments.  For  instance,  in  the 
example  of  Figures  2.3  and  2.M,  there  are  four  ways  of  re¬ 
constructing  a  complete,  non-redundant  cooy  of  the  logical 


PERSONNEL  relation: 
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1 .  One  which  consists  of 

Stored-PERSONNEL,  .  Stored-PERSONNEL,  Stored- 

1  i  nl  c  flu 

PERSONNEL3  n,  and  Stored-PERSONNEL,,  Q; 

2.  Another  consisting  of 

Stored-PERSONNEL,  Stored-PERSONNEL,  Stored- 

PERSONNEL^  Q,  and  Stored-PERSONNEL,,  Q. 

3.  A  third  consisting  of 

Stored-PERSONNEL,  .  Stored-PERSONNEL,  .  Stored- 

i ,  m  d ,  o 

PERSONNEL3  n,  and  Stored-PERSONNEL,,  Q;  and 

4.  Finally,  one  consisting  of 

Stored-PERSONNEL,  Stored-PERSONNEL,  Stored- 

I  , 01  <:,0 

PERSONNEL3  o>  and  Stored-PERSONNEL,,  Q . 

Similarly  there  are  six  ways  of  reconstructing  the  INVEN¬ 
TORY  relation  in  that  example. 

A  collection  of  stored  fragments  which  form  a  complete  and 
non-redundant  copy  of  the  logical  database  is  called  a  ma¬ 
terialization.  At  any  given  time  SDD-1  recognizes  a  spe¬ 
cific  set  of  materializations  as  "supported".  A  user 
logging  in  to  SDD-1  is  given  access  to  the  database 
through  a  specific  supported  materialization  and  repeated 
accesses  to  the  system  will  result  in  assignment  to  the 
same  supported  materialization. 
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The  make-up  of  each  supported  materialization  is  recorded 
in  a  table  such  as  the  one  illustrated  in  Figure  2.5.  The 
management  of  this  and  other  directory  information  will  be 
described  in  Section  3.  Another  table  indicates  the  ma¬ 
terialization  to  which  each  user  is  assigned.  SDD-1  will 
service  a  user’s  retrieval  requests  by  accessing  only  the 
stored  fragments  listed  for  his  assigned  materialization. 

The  concept  of  supported  materializations  (hereafter 
called  simply  "materializations")  is  an  important  element 
of  SDD-1 ’ s  data  distribution  model.  Since  each  material¬ 
ization  is  a  complete  and  non-redundant  copy  of  the  data¬ 
base,  it  is  analogous  to  the  simpler  concept  of  logical 
relations  in  the  non-distributed  database  setting.  The 
materialization  concept  plays  a  central  role  in  the  notion 
of  database  consistency  in  SDD-1  and  has  a  major  impact  on 
the  problem  of  updating  redundantly  stored  data.  This 
issue  is  addressed  in  Section  4, 
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Table  of  Fragment  Assignments  Figure  2.4 
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Table  entry  is  datamodule  from  which  the  indicated  materi¬ 
alization  obtains  each  fragment.  Note  that  the  table 
shows  supported  materializations  only  and  does  not  show 
all  possible  materializations. 
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3.  Directorv  Management 


One  issue  which  is  freauentlv  cited  (e.g.,  in  [FRY  and 
SIBLEY],  [SIBLEY],  [STONgBRAKER  and  NEUHOLh] )  as  signifi¬ 
cant  in  the  design  of  distributed  database  systems  is  the 
management  of  the  global  system  directory.  This  directory 
provides  the  information  which  the  RDM’s  reauire  to  parse 
user  reauests,  to  locate  data  of  interest,  and  to  choose 
accessing  and  uDdating  strategies. 

Alternatives  for  handling  the  directorv  include: 

-  storing  it  in  a  single,  central  datamodule; 
storing  a  complete  copy  at  each  datamodule; 
dispersing  the  directory  over  the  set  of  datamod- 
ules  non-redundantly;  and 

dispersing  the  directory  over  the  set  of  datamod- 
ules  with  some  redundancy. 

The  optimal  choice  among  these  alternatives  depends  upon 
the  pattern  of  transaction  traffic  in  the  network. 
Ideally,  general  purpose  distributed  database  systems 
would  not  adopt  any  of  these  alternatives  directly  but 
rather  would  permit  the  choice  to  be  made  as  part  of  data¬ 
base  design. 
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SDD-1  achieves  this  flexibility  bv  treating  the  directory 
as  ordinary  user  data.  The  data  which  comorise  the  direc¬ 
tory  are  partitioned  into  fragments  just  as  all  other  user 
data  are,  and  like  user  data,  these  fragments  are  stored 
in  a  distributed  and  redundant  manner  throughout  the 
system.  In  this  way  arbitrary  levels  of  redundancy  and 
distribution  of  the  directory  can  be  supported.  It  should 
be  noted  that  treating  the  directory  as  user  data  also 
makes  other  system  features  such  as  security,  integrity, 
and  concurrency  control  available  for  directory  manage¬ 
ment. 

There  is  one  regard  in  which  directory  information  cannot 
be  treated  identically  to  user  data  and  that  is  in  the 
matter  of  where  the  directory  information  for  the  direc¬ 
tory  itself  is  stored.  It  is  necessary  to  treat  this  in¬ 
formation  specially  in  order  to  provide  the  system  with  a 
known  starting  point  from  which  directory  information  can 
be  found. 

This  problem  is  handled  in  SDD-1  by  factoring  the  direc¬ 
tory  information  for  directories  into  separate  relations 
which  are  stored  in  every  datamodule.  The  decision  to 
store  this  level  of  directory  information  at  all  datamod- 
ules  is  based  on  the  assumption  that  these  relations  are 
small  and  very  retrieval  intensive. 
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4.  Efficient  Redundant  Updating 


In  develooing  a  distributed  database  system  such  as  SDD-1 , 
one  of  the  key  problems  is  to  update  the  multiple  copies 
of  a  single  logical  data  item  in  such  a  way  that  consis¬ 
tency  of  the  database  is  preserved  and  excessive  synchro¬ 
nization  overhead  is  avoided.  The  SDD-1  approach  to  this 
problem  is  treated  in  detail  in  [ROTHNIF  et  al]  and 
[BERNSTEIN],  In  this  section  we  outline  certain  key 
aspects  of  the  technique. 

The  notion  of  database  consistency  is  defined  for  SDD-1  in 
terms  of  materializations.  Since  each  (supported)  materi¬ 
alization  represents  a  non-redundant  logical  view  of  the 
database  as  it  might  be  seen  by  a  user,  a  strong  form  of 
consistency  must  be  preserved  within  each  materialization. 
Indeed  the  internal  consistency  of  each  materialization 
must  be  maintained  as  stronely  as  the  internal  consistency 
of  a  conventional,  centralized  database  system,  i.e: 
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We  first  assume  that  each  transaction  always  maos  a 
consistent  database  state  into  another  consistent 
state*.  Then,  internal  consistency  can  be  defined 
in  terms  of  seriali2abilitv  (cf  [ESWAPAN  et  al], 
[GRAY  et  al],  and  [HEWITT]).  Serializabilitv 
dictates  that  the  effect  of  a  set  of  concurrently 
executing  transactions  on  a  database  oust  be  eauiv- 
alent  to  the  effect  of  some  serial,  non-overlapoing 
sequence  of  those  same  transactions.  Then,  since 
each  transaction  running  separately  does  not 
violate  database  consistency,  the  effect  of  the 
concurrent  collection  cannot  be  inconsistent. 

This  form  of  consistency  is  quite  strong  and  it  is  expen¬ 
sive  to  preserve  in  terms  of  communication  costs  and  pro¬ 
cessing  delays. 

Fortunately  it  is  sufficient  to  maintain  a  much  weaker 
form  of  consistency  between  materializations.  All  that  ’s 
required  for  mutual  consistency  between  materializations 
is  that  the  database  cooies  they  represent  not  diverge 
over  time.  It  is  not  necessary  for  the  cooies  to  be  in- 


•  Verifying  that  separate  transactions  will  not  violate 
consistency  is  an  integrity  checking  step  of  the  sort  de¬ 
scribed  by  (HAMMER  and  MCLEOD],  It  will  not  be  considered 
here. 
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stantaneously  identical  at  all  times;  it  is  sufficient 
that  they  would  attain  the  same  final  state  were  all 
update  activity  to  cease. 

The  simplest  method  for  controlling  redundant  uDdates  is 
to  lock  those  portions  of  the  database  being  read  or  writ¬ 
ten  by  active  transactions.  However,  in  a  distributed  da¬ 
tabase  environment  an  aooreciable  and  often  intolerable 
delav  is  introduced  while  locking  information  is  orooa- 
gated  to  the  many  comDuters  in  the  database  network. 

More  efficient  methods  for  locking  in  a  distributed  data¬ 
base  system  have  been  proposed  by  [THOMAS-b]  and  by 
[ALSBERG  and  DAY],  The  method  proposed  by  [THOMAS-b] 
reduces  the  volume  of  inter-module  communications  needed 
for  locking  bv  utilizing  a  voting  protocol;  in  this 
method  one  need  only  communicate  with  a  majority  of  data- 
modules  rather  than  with  all. 

The  [ALSBERG  and  DAY]  method  is  ouite  different  and  intro¬ 
duces  the  notion  of  a  "primary  site"  for  updating.  In 
this  approach  all  update  activity  in  the  distributed  data¬ 
base  system  must  be  handled  through  a  single  primary 
update  site,  *nd  all  locking  occurs  locally  within  that 
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Although  the  improvements  suggested  bv  these  authors  may 
be  appropriate  in  certain  contexts,  both  are  infeasible  in 
a  general  distributed  system  like  SDD-1 . 

The  method  used  in  SDD-1  differs  qualitatively  from  this 
previous  work.  Rather  than  merely  trying  to  improve  the 
efficiency  of  global  database  locking  the  SDD-1  method 
seeks  to  avoid  global  locking  whenever  possible.  While  it 
is  true  that  in  general,  arbitrary  transactions  must  place 
locks  throughout  the  database  network,  there  are  many 
cases  in  which  locking  need  only  occur  in  the  datamodule 
where  the  transaction  was  initiated.  In  these  cases,  the 
proDagation  of  the  update  information  to  all  the  other  re¬ 
dundant  copies  of  the  data  is  carried  out  by  an  efficient 
protocol  which  again  only  requires  locking  on  a  module  by 
module  basis. 

Thus  this  uDdating  method  permits  SDD-1  to  exhibit  the 
same  fast  response  for  most  update  transactions  that  it 

i 

can  offer  on  retrieval.  For  any  transaction  the  auestion 
of  whether  it  may  be  run  without  global  locking  depends  on 
what  other  transactions  are  to  be  run  in  that  manner. 
This  decision  is  made  by  the  database  administrator  as 
part  of  the  database  design  activity. 

The  database  administrator  defines  classes  of  transactions 
that  are  commonly  executed  in  the  database  application  and 
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for  each  class  he  specifies  which  datamodules  are  expected 
to  enter  the  transactions.  Then  by  applying  the  process 
presented  in  [ROTHNIE  et  al]  and  [BERNSTEIN],  the  specif¬ 
ied  configuration  of  transactions  is  analyzed  to  determine 
the  amount  of  synchronization  that  is  reauired  for  trans¬ 
actions  in  each  class. 

A  table  representing  the  output  of  the  analysis  is  used  by 
each  datamodule  at  run-time  to  determine  the  appropriate 
synchronization  level  reouired  by  a  sriven  transaction.  It 
is  important  to  recognize  that  this  predefinition  activity 
in  no  way  limits  the  transactions  which  can  be  processed 
by  the  system.  It  merely  permits  more  efficient  execution 
of  those  classes  which  have  been  anticipated. 
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5.  Efficient  Dispersed  Data  Access 

The  distribution  of  data  in  systems  like  SDD-1  represents 
a  potential  problem  in  responding  rapidly  to  complex 
queries.  The  execution  of  a  user  reouest  involving  cross- 
referencing  operations  (e.g.,  joins)  over  relations  at 
several  sites  can  incur  a  very  substantial  delay.  This 
delay  is  caused  by  the  need  to  bring  together  at  one  pro¬ 
cessor  all  the  data  required  for  the  execution  of  the 
cross-referencing  operation. 

The  approach  adopted  in  SDD-1  for  handling  this  pt*  il-lem  is 
described  in  detail  in  [WONG],  The  approach  is  similar  in 
a  formal  sense  to  the  decomposition  strategy  proposed  by 
[WONG  and  YOUSSEFI]  for  optimizing  Queries  in  a  non- 
distributed  database  setting.  The  algorithm  in  the  dis¬ 
tributed  environment  operates  by  decomposing  a  multi- 
datamodule  query  into  a  sequence  of  local  aueries  involv¬ 
ing  data  at  only  one  datamodule,  with  inter-module  data 
transfers  between  the  local  queries.  The  execution  strat¬ 
egies  which  are  generated  by  this  technique  consist  of  a 
sequence  of  steps,  each  of  which  is  either 
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a.  a  set  of  site-to-site  data  moves  or 

b.  local  processing  on  data  which  has  been  moved  to  a 
single  site. 

The  sequence  of  steps  selected  is  determined  bv  a  step-by- 
step  optimization  process  which  can  be  represented  by  a 
binary  decision  tree. 

A  key  assumption  of  the  optimization  technique  is  that 
communication  costs  will  dominate  the  processing  of 
complex  queries  in  a  distributed  environment.  Hence  at 
each  step  the  optimization  procedure  first  attempts  to 
minimize  communication  costs  and  then,  within  the  optimal 
communication  strategy,  attempts  to  minimize  local  pro¬ 
cessing  . 
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6 .  Summary 

SDD-1  is  a  distributed  database  system  in  the  earlv  stages 
of  implementation  which  is  characterized  by  the  following 
key  concepts: 

1.  It  is  implemented  as  a  set  of  communicating  pro¬ 
cessors,  called  datamodules,  which  are  disDersed 
worldwide  and  which  are  envisioned  to  number  in  the 
hundreds  for  a  single  system. 

2.  The  distributed  implementation  is  invisible  to 
users. 

3.  SDD-1  supports  arbitrary  redundancy.  The  physical 
storage  of  data  items  is  defined  in  terms  of 
logical  subrelations  called  fragments. 

4.  Each  datamodule  is  implemented  as  two  communicating 
processes,  called  the  global  data  manager  and  the 
local  data  manager.  The  partition  of  function 
between  these  two  components  permits  the  eventual 
incorporation  of  existing  database  management 


systems  into  a  heterogeneous  SDD-1  . 
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5.  Directory  information  is  stored  in  SDD-1  as  ordi¬ 
nary  user  data.  This  oermits  choices  regarding  the 
distribution  and  redundancy  of  the  directory  to  be 
delayed  until  database  design. 

6.  UDdating  multi-cooy  data  items  is  a  Drocess  which 
can  represent  an  intolerable  bottleneck  in  distrib¬ 
uted  systems.  SDD-1  employs  a  transaction  classi¬ 
fication  technique  which  avoids  time-consuming 
inter-module  synchronization  for  most  update 
transactions . 

7 .  The  system  uses  an  access  planning  technique  which 
attempts  to  minimize  communication  costs  as  its 
primary  optimization  goal  and  local  processing 
costs  as  a  secondary  goal. 
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